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Abstract. Motivated by the sequence form formulation of Koller et al. Ifl9l, this paper 
defines bilinear games, and proposes efficient algorithms for its rank based subclasses. Bi- 
ryy ■ linear games are two-player non-cooperative single-shot games with compact polytopal 

qq . strategy sets and two payoff matrices (A, B) such that when (x, y) is the played strategy 

CN ! profile, the payoffs of the players are x 1 Ay and x T By respectively. We show that bilin- 

ear games are very general and capture many interesting classes of games like bimatrix 
f— I ■ games, two player Bayesian games, polymatrix games, two-player extensive form games 

with perfect recall etc. as special cases, and hence are hard to solve in general. 
5/5 | Existence of a (symmetric) Nash equilibrium for (symmetric) bilinear games follow 

directly from the known results. For a given bilinear game, we define its Best Response 
Polytopies (BRPs) and characterize the set of Nash equilibria as fully-labeled pairs in the 
BRPs. We consider a rank based hierarchy of bilinear games, where rank of a game {A, B) 
■ is defined as rank(A + B) . In this paper, we give polynomial time algorithms to compute 

OO . Nash equilibrium for special classes of bilinear games: 

- Rank-1 games (i.e., rank(A + B) = 1). 

- FPTAS for constant rank games (i.e., rank(A + B) is constant). 

- When rank(A) or rank(B) is constant. This improves the results by Lipton et al. Il22| 
r— I | and Kannan et al. [17], for bimatrix games with low rank matrices. 

> '■ 

j^-j ■ 1 Introduction 

In the last decade, there has been much research at the interface of computer science and game 



theory (see e.g. B24I26|1 ). One fundamental class of computational problems in game theory is the 
computation of solution concepts of finite games. Nash 1(23]) proved that in any finite game there 
always exists a steady state, from where no player gains by unilateral deviation. Such a steady state 
has since been named Nash equilibrium (NE) and is perhaps the most well-known and well-studied 
game-theoretic solution concept. However, computing a Nash equilibrium is nontrivial, and indeed 
the recent series of papers [41711111 established that the problem is PPAD-complete for finite games 
in the standard normal form representation, even for games with only two players. Furthermore it 
is PPAD-complete to even find a -J^j -approximate Nash equilibrium 0. In light of these negative 
results, one direction is to identify subclasses of games for which the problem is tractable. 

A two-player normal form game can be represented by two payoff matrices, say A and B, one 
for each player, and hence is also known as bimatrix game [24 J. For bimatrix games, polynomial 
time NE computation algorithms are known for many subclasses, including zero-sum games [6], 
(quasi-) concave games [20J, and games with low rank payoff matrices |22| . A line of work focuses 
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on games of small rank, defined as rank(A + B) by Kannan and Theobald Ifl7l| . They gave a fully 
polynomial time approximation scheme (FPTAS) for fixed rank games and recently Adsul et al. 
|ffll gave a polynomial time algorithm for computing an exact Nash equilibrium for rank-1 games. 
Specifying the two payoff matrices of a bimatrix game requires a polynomial number of entries 
in the numbers of pure strategies available to the players. This is adequate when the set of pure 
strategies are explicitly given. However, there are situations where the natural description gives 
the set of pure strategies implicitly and as a result they may be exponential in the description of 
the game. For example, normal form (bimatrix) representation of two player extensive-form game 
may have exponentially many strategies in the size of the extensive-form description [9 J. In such a 
case, even if the resulting bimatrix game has a fixed rank, the above results may not be applied for 
efficient computation. 

Nevertheless, certain types of extensive-form games have some combinatorial structure which 
can be exploited. Koller, Megiddo and von Stengel |T9| converted an arbitrary two-player, perfect- 
recall, extensive form game into a payoff-equivalent two-player game with continuous strategy sets. 
In this derived formulation, which they call the sequence form, there is a pair of payoff matrices A 
and B, one for each player. Further, their strategy sets turn out to be compact polytopes in Euclidean 
space of polynomial dimension. Given a pair of strategies (x,y), utilities of the players are x T Ay 
and x T By respectively. Interestingly, the sequence form requires only a polynomial number of bits 
to specify. 

Motivated by the sequence form of Koller et al, we define bilinear games, which are two-player, 
non-cooperative, single shot games represented by two payoff matrices, say A and B, of dimen- 
sion M x N and two polytopal compact strategy sets X = {x £ R M | Ex = e, x > 0} and 
Y = {y G R N | Fy = /, y > 0}. If [x,y) e X x Y is the played strategy profile, then x 1 Ay and 
x T By are the utilities derived by player one and player two respectively. In other words, the pay- 
offs are bilinear functions of strategies, hence the name bilinear games. The scope of bilinear games 
is large enough to capture many interesting classes of games besides two-player extensive form 
games with perfect recall. For example, for two-player Bayesian games I15l25ff , polymatrix games 
HU, an d various classes of optimization duels [16]) , researchers have proposed polynomial-sized 
payoff-equivalent formulations which (either explicitly or implicitly) turn out to be bilinear games 
(see Section |2T1 for details). Intuitively the polytopal strategy sets are concise representations of the 
original sets of mixed strategies as marginal probabilities, and x T Ay and x T By express the expected 
utilities of the original game in terms of these marginal probabilities. 

Remark 1. Note that our formulation can express arbitrary polytopes as strategy space, for example 
if the strategy set of a player is expressed as {x : Gx < g}, i.e., the intersection of a set of half-spaces, 
it can be transformed to an equivalent game with strategy set {x' : Ex 1 = e,x' > 0} using standard 
techniques (i.e., by adding slack variables, substituting unbounded X{ with xf — , x+, xj > 0, and 
modifying the payoff matrices accordingly). 

As we have seen that many different types of games can be concisely described as bilinear games, 
designing efficient algorithms for the more general bilinear games seem to be important as well as 
challenging. Since bimatrix games is a subclass of bilinear games (see Section 12.1) 1. all the hardness 
results of bimatrix games automatically apply to bilinear games as well. Therefore, the only hope 
is to design efficient algorithms or FPTAS for the special subclasses. There are many similarities 
between bilinear and bimatrix games, for example, payoffs are represented by two matrices, and 
utilities are bilinear functions of strategy vectors, hence it is natural to try to adapt algorithms for 
bimatrix games to bilinear games. However, a technical challenge is that the polytopal strategy sets 
of bilinear games are generally much more complex than the bimatrix case; in particular the number 
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of vertices may be exponential, while the set of mixed strategies is just a simplex. Recently, we were 
pointed to the constrained games, similar to bilinear games, considered by Charnes |3J. The lin- 
ear programming technique by Charnes H3j also works for zero-sum bilinear games, i.e., those with 
{A + B) = (see also [18] for sequence form, [16j for another derivation, [25J for zero-sum Bayesian 
games and [8] for zero-sum polymatrix games). Further, it is easy to show that the linear comple- 
mentarity program (LCP) characterization for the set of NE of a sequence form game [19J works for 
bilinear games as well. However the Lemke-Howson algorithm may not be directly applied to the 
general bilinear games. There are results for certain subclasses of bilinear games with specific struc- 
ture in their strategy sets, e.g., Howson and Rosenthal |15| adapted the Lemke-Howson algorithm 
to two-player Bayesian games and Koller et fl/. |T9| adapted Lemke's algorithm [21 J to two-player 
extensive-form games. 

Our Contribution. In Section^ we define the bilinear game and show that the existence of a (sym- 
metric) Nash equilibrium in a (symmetric) bilinear game directly follows from the known results. 
Note that given a strategy of a player, the other player would like to play a utility maximizing strat- 
egy. We formulate this problem as a primal-dual LP, similar to the Koller et al. formulation. Using 
the complementarity conditions of the primal-dual, we characterize the Nash equilibria and then 
define Best Response Polytopes (BRPs) and the notion of fully-labeled pairs in BRPs. Further, we show 
one-to-one correspondence between the Nash equilibria and fully-labeled pairs. This in turn gives a 
quadratic programming (QP) formulation for the NE computation problem. 

Next, we extend Kannan and Theobald's [17J rank-based hierarchy for bimatrix games to bilinear 
games, by defining the rank of a bilinear game with payoff matrices (A,B) as the rank of (A + 
B). Zero-sum games are rank-0 games and a NE for these games can be computed efficiently, as 
discussed above. In Section [3[ we show that in spite of a very general structure of the strategy sets 
in bilinear games, the basic approach given by Adsul et al. (T) to compute a NE of a rank-1 bimatrix 
game, can be generalized to compute a NE of a rank-1 bilinear game by solving a rank-1 QP. While 
solving a rank-1 general QP is NP-hard |12|, those arising from the bilinear games can be solved in 
polynomial time. In Section [4[ we discuss two FPTAS algorithms for the fixed rank bilinear games, 
which are generalization of the algorithms by Kannan and Theobald [17| for the bimatrix games. 
Finally, in Section[5l using the structure of BRPs, we obtain a polynomial time algorithm for the case 
when the rank of either A or B is a constant, and rank of E and F are also constant. Since, a bimatrix 
game can be thought of as a bilinear game with E and F being a single row of Is (see Example [TJ 
of Section I2.1[) , this algorithm improves upon a result by Lipton et al. |22| and Kannan et al. IfTTll 
for bimatrix games, where they require the rank of both A and B to be constant. This approach also 
gives an enumeration algorithm for extreme equilibria, which runs in polynomial time under the 
above assumption and exponential time for the general bilinear games. 

The following table summarizes all the NE computation results while keeping the bimatrix 
games in perspective. 



Results 


Bimatrix games 


Bilinear games 


Existence of (symmetric) NE 


Nash [23] 


Easy to show using ([23]) [10] 


LCP formulation for NE 


Known [24J 


Koller et al. [19] 


NE as fully-labeled pairs of BRPs 


Known [27] 


This paper 


Zero-sum games 


Linear programming [6] 


Linear programming [18] 


Rank-1 games 


Adsul et al. HI 


This paper 


FPTAS for fixed rank games 


Kannan and Theobald II17II 


This paper 


Games with low rank matrices 


Lipton et al. [22| 


This paper 
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2 Bilinear Games and Nash Equilibria 

Notations. We consider a vector x as a column vector by default and for the row vector, we use 
transpose (i.e., x T ). A "0" in the block representation of a matrix, is the matrix with all zero entries 
of appropriate dimension, and "1^" is a vector of all Is of length k. Let x G K," be a vector and 
c G R be a scalar, then by x < c we mean, Vz < n, x, < c. For a given matrix X, X, denotes the 
i row of X, X^ denotes the / column of X and |X| denotes the maximum absolute entry in X, 
i.e., |X| = max,-,- |X«|. For a set S, A(S) denotes the set of probability distribution vectors over the 

elements of S, i.e., A(S) = {x G E |s| | x > 0, Lfes x i = !}• 

Bilinear games are two-player non-cooperative, single shot games. A bilinear game is repre- 
sented by two MxN dimensional payoff matrices A and B, one for each player, and two compact 
polytopal strategy sets. Let Si = {1, . . . , M} be the set of rows and S2 = {1, . . . , N} be the set of 
columns of the matrices. Let E G R klxM and F G M k2xN be the matrices, and e G R* 1 and / G R h 
be the vectors. The strategy set of the first-player is X = {x G R M | Ex = e, x > 0} and the second- 
player is Y = {y G R N I Fy = f,y > 0}. Sets X and V are assumed to be compact. From a strategy 
profile (x,y) G X x Y, the payoffs obtained by the first and the second player are x T Ay and x T By 
respectively. 

From a Nash equilibrium (NE) strategy profile, no player gains by unilateral deviation. Formally, 



Definition 1. A strategy profile (x,y) G X x Y is a NE of the game (A, B) iff ' x 1 Ay > x' T Ay, Vx' G X 
and x T By > x T By', Vy' G Y. 

As a direct corollary of Glicksberg's [10] result that there always exists a Nash equilibrium in a 
game whose players' strategy spaces are convex and compact, and whose utility function for each 
player i is continuous in all players' strategies and quasi-concave in i's strategy, we have 

Proposition 1. Every bilinear game has at least one Nash equilibrium. 

A bilinear game is completely represented by a six-tuple (A,B,E,F ,e, f) in general. However, 
for ease of notation we represent it by ( A, B) fixing (E, F, e, f) . Given a strategy y G Y of the second- 
player, the objective of the first player is to play x G X such that x T (Ay) is maximized, i.e., solve the 
following linear program |Tl9| . 

max : x T (Ay) . T 

s.t. Ex = e 



x > 



min : e p 

^4 s.t. E p > Ay (1) 



Note that p, is the dual variable of the equation E,x = e, in the above program. At the optimal 
point (x, p) of (fT), we get x, > => A,y = p T E', Vz G Si from the complementarity. A similar 
condition can be obtained for the second-player, given an x G X. At a Nash equilibrium both the 
conditions are satisfied, and these characterize the NE strategies as follows: A strategy pair (x,y) G 
X x Y is a Nash equilibrium of the game (A, B) iff it satisfies the following conditions. 



3p G R kl s.t. Ay < E T p and Vz G Si, x ; - > => A { y = p T E i 
3q G K^ 2 s.t. x T B < q T F and V; G S 2 , t/ ; > => x T B' = q T Fi 



The above characterization implies that, a player plays a strategy with non-zero probability only 
if it gives the maximum payoff with respect to (w.r.t.) the opponent's strategy in some sense. Such 
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strategies are called the best response strategies (w.r.t. the opponent's strategy). Using this fact, we 
define best response polytopes (BRPs), similar to the best response polytopes of a bimatrix game p7| . 
In the following expression, x, y, p and q are vector variables. 



P = { (y, p) G R N+kl | A,y - p T E i < 0, Vz G S x ; y f > 0, V; G S 2 ; Fy = /} 
Q = { ( X/ q) G R M+,C2 | Xi > 0, Vz G Si; x T B> - q T F' < 0, V; G S 2 ; Ex = e} 



(3) 



The polytope P in 10) is closely related to the best response strategies of the first-player for any 
given strategy of the second-player and it is called the best response polytope of the first-player. 
Similarly Q is called the best response polytope of the second-player. Note that, in both the polytopes 
the first set of inequalities corresponds to the first-player, and the second set corresponds to the 
second-player. Since | Si | = M and | S 2 1 = N, let the inequalities be numbered from 1 to M, and M + 1 
to M + N in both the polytopes. Let the label L(v) of a point v in the polytope be the set of indices of 
the tight inequalities at v. If a pair (v, w) G P x Q is such that L(v) U L(w) = {1, . . . , M + N}, then 
it is called a fully-labeled pair. The proof of the next lemma follows using iQ. 

Lemma 1. A strategy profile (x,y) is a NE of the game (A,B) iff ((y, p), (x,q)) G P x Q is a fully-labeled 
pair, for some p and q. 

A game is called non-degenerate if both the polytopes are non-degenerate. Note that a fully- 
labeled pair of a non-degenerate game has to be a vertex-pair. Lemma [T] implies that a NE strategy 
profile has to satisfy the following linear complementarity conditions (LCP) over P x Q |T9| . 

((y, p), (x,q)) G P x Q corresponds to a NE x T (Ay — E T p) = and 

(x T B-q T F)y = (4) 

Clearly, x T (Ay - E T p) < and (x T B - q T F)y < over P x Q and hence x T (Ay - E T p) + (x T B - 
q T F)y < 0. Simplifying the expression using Ex = e and Fy = f we get x T (A + B)y — e T p — f T q < 
over P x Q and equality holds iff (x,y) is a NE (using (HJl). This gives the following QP formulation 
which captures all the NE of game (A, B) at its optimal points. 

max: x T (A + B)y — e T p — f T q . . 

s.t. ((y,p),(x,q)) GPxQ ( j 

Symmetric Bilinear Games. Nash [23J proved that any symmetric finite game has a symmetric Nash 
equilibrium. The concept of symmetry can be straightforwardly adapted to the bilinear games: We 
say a bilinear game is symmetric if B = A T , E = F, and e = f. A strategy profile (x,y) is symmetric 
if x = y. A straightforward adaptation of Nash's [23J proof yields the following proposition. 

Proposition 2. Any symmetric bilinear game has a symmetric NE. 

Note that for a symmetric game (A, A T ), strategy sets X and Y are the same. From (0 and (JU), it 
is clear that a symmetric NE x G X must satisfy q = p, Ax < E T p, x T (Ax — E T p) = 0. This gives 
the following QP formulation to capture all symmetric NE of a symmetric game (A, E, e). 

max: x T Ax — e T p . . 

s.t. Ax < E T p; Ex = e; x > ( ' 

A bilinear game {A, B, E, F , e,f) can be converted to an equivalent symmetric game (A', E' , e'), 
where 



A' 



A 
B T 



E' 



E 
F 



e' 



, with strategy vector z 



5 



It is easy to check that any symmetric NE of the derived game corresponds to a NE of the original 
game and vice-versa. In the next section, we discuss reductions of different games to polynomial size 
bilinear games, which do not seem to be possible with the bimatrix games. 

2.1 Examples of Bilinear Games 

The simplest subclass of bilinear games is the set of two-player normal-form games (bimatrix games). 
Example 1. Bimatrix Games 

A bimatrix game (A,B) with A,B 6 R MxN can be straightforwardly transformed to the bilinear 
game (A, B, E, F, e,f) where E T = 1m, e = 1 and similarly F T = 1^, f = 1. □ 

Many other interesting classes of finite games may be formulated as bilinear games. In this sec- 
tion, we provide a few examples where the bilinear formulation are exponentially smaller than a 
direct bimatrix formulation. 

Example 2. Two-player Bayesian Games 

In a Bayesian game lTl3| , there is a type set associated with each player, which is her private infor- 
mation. The nature draws the type for each player from a joint distribution, which is a common 
knowledge, and each player gets to know only her own type before choosing an action. The final 
utilities of the game is determined by types of all the players, and hence are uncertain. 

Here we consider the two-player case, where T;s are the type sets and S,s are the strategy sets. 
The joint probability distribution is denoted by pt s for the type profile (f,s) G T\ x Ti- Let S = 
Si x S2, T = Ti x T2, I T;| = tj and |S;| = m,-. The utilities are the functions of actions and types, i.e., 
Uj : S x T — >■ R, hence for every type profile they can be represented by the two matrices. For a type 
profile (£, s) let A ts and B ts denote the respective m\ x m^ dimensional payoff matrices. The strategy 
of a player is to decide her play for each of her type so that her expected payoff is maximized, i.e., 
x : Ti — > A(Si) for player one and y : Tj — ► A(S2) for player two. For a t G T\, let x f denote the 
mixed strategy given type t. 

The induced normal form of this game is a bimatrix game in which each pure strategy of a player 
prescribes an action for each of her types. Thus the size of the induced normal form is exponential in 
the number of types. However, it can be formulated as a polynomial sized bilinear game as follows 





' VnA 11 ■ 


■ pit 2 A lt2 ~ 




' PnB 11 • 


■ Vit 2 B lh ~ 




"l T • • •" 




"l T • • •" 


A = 






,B = 






,E = 


1 T 
u ± m 


,F = 


1 T 




_P h iA^ ■ 


■ Vhh Ahh _ 




_p hl B^ ■ 


■ VhhB hh _ 











and e = 1^, / = lj 2 . Given mixed strategies x 1 , . . . ,x tl ,y 1 , . . . ,y t2 of the Bayesian game, define 
x = [x lT , ■ ■ ■ , x^] T and y = [y lT , • • • ,y f 2 ] T . Then x T Ay and x T By are exactly the expected utilities 
of the Bayesian game. This transformation is implicit in Howson and Rosenthal's |15] adaptation of 
Lemke-Howson algorithm to two-player Bayesian games. □ 

Example 3. Polymatrix Games /O 

A polymatrix game is an n-p layer game in which each player's utility is the sum of the utilities 
resulting from her bilateral interactions with each of the n — 1 other players. Let S, be player z's set 
of pure strategies. The game is represented by the payoff matrices A'i G bJ s 'I x I s ;I for all pairs of 
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players Letx' G A(S,) denote a mixed strategy of player i. Given a strategy profile (x 1 ,. . .,x n ), 
the expected utility of player i is, 

u i (x\...,x n ) = Y^(x i ) T ^ ij x j 

m 

We show that any polymatrix game can be transformed to a symmetric bilinear game such that 
any symmetric NE of the bilinear game corresponds to a NE of the polymatrix game. Our derivation 
is adapted from Howson's fl4| formulation of NE of polymatrix games as solutions of an LCP. 
Formally given a polymatrix game, we define the induced symmetric bilinear game as (A, A T , E, E, e, e), 
where 



A 



A 12 
A 21 

A n\ A n2 



A ln 
A 2n 







L \Si\ 

1 T 



s 2 \ 







n l lT 








If 



and e = l n . The space of strategy vectors is x = [x lT ■ ■ ■ x 

Proposition 3. Consider a polymatrix game ofn players. The strategy (x 1 , x 2 , . . . , x n ) is an NE of the game 
if and only if (x, x) is a symmetric NE of its induced bilinear game. 

Proof. The proof is relatively straightforward, by observing that the respective incentive constraints 
are equivalent. Thus the problem of finding a NE of a polymatrix game reduces to the problem of 
finding a symmetric NE of a symmetric bilinear game. Note that an asymmetric NE doesn't corre- 
spond to a NE of the polymatrix game. □ 

Immorlica et al. |16!1 analyzed several classes of games between two optimization algorithms 
whose objectives are to outperform each other. The space of pure strategies are the possible outputs 
of the algorithm which are exponential, however the authors were able to formulate some of these 
games as zero-sum bilinear games (which they call bilinear duels). We describe one example from 

EH. 



Example 4. Ranking Duels nT6V 

Each player chooses a ranking over m elements. Thus the number of pure strategies is exponential in 
m. Such a ranking can be represented as am x m permutation matrix. By the Birkoff-von Neumann 
theorem, the space of mixed strategies corresponds to the space ofmxm doubly-stochastic matrices 
with each row and each column sum to 1. This space can be described by the polytope {x G M!" 2 | x > 
0; V/, YLj x ij = 1/ V/, Ya x ij = !}• Viewing x and y as column vectors, the sizes of the corresponding 



E, e are polynomial in m. Immorlica et al. [16] constructed matrices A G R" 
expected utilities are equal to x 1 Ay and — x T Ay respectively. 



such that the players' 
□ 



Example 5. Two-player Perfect-recall Extensive-form Games 

Extensive-form represents a dynamic game as a tree ||26| , where every pure strategy of a player 
prescribes a move at each of the player's information sets. As a result the number of pure strategies 
may be exponential in the size of the extensive-form description. Fortunately, if we assume perfect 
recall — roughly, that each player remembers all her past decisions and observations — then there 
always exists a Nash equilibrium in behavior strategies, where each player independently chooses 
a distribution over actions at each of her information sets. Representation of a behavior strategy 
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requires space linear in the extensive form. However, the expected utilities of the two-player perfect- 
recall extensive-form games are not bilinear functions of the behavior strategies. Koller et al. |19| 
proposed the sequence form, which is a bilinear game formulation for these games. The number of 
rows and columns of the matrices A and B, in the bilinear form, are the number of feasible sequences 
of plays of the first and second player respectively. If a play sequence pair leads to a leaf node 
then the ij™ entry of A and B are the payoffs of the first and second player at that leaf node, otherwise 
it is zeroQ A strategy x of the first player is such that, x(root) = 1, and if a sequence a ends at an 
information node C, then YlaeActionsfc) x ^a — x a = 0. Similar conditions hold for a strategy y of the 
second player. Such a strategy may be transformed to a behavior strategy and vice versa. This gives 
E, F, e and /. Note that the reduction is polynomial sized and x T Ay and x T By are exactly the expected 
payoffs under the corresponding behavior strategies. We refer readers to [19] for more details. □ 

Remark 2. Lemke's algorithm on the LCP formulation of bilinear games terminates with a solution 
(i.e., not at a ray) if the only non-negative solutions x and y to Ex = and Fy = are x = and 
y = 0, and the payoff matrices are non-positive, i.e., A < and B < 0. This result directly follows 
from [19 J. Note that games of all the above examples satisfy these requirements, without loss of 
generality. 

The rank of a game (A, B) is defined as rank(A + B), and we consider the rank based hierarchy 
of the bilinear games. The set of rank-A; games consists of all (A, B) such that rank(A + B) < k. 
Zero-sum games are rank-0 games, the smallest set in the hierarchy. Koller et al. Ifl8~t gave an LP for- 
mulation for zero-sum bilinear games, derived from two-player extensive form games with perfect 
recall. However, their formulation works for general bilinear games as well. Beyond rank-0 games, 
no polynomial time algorithm is known for NE computation (even for the reduction specific formu- 
lations). In the next section, we extend the polynomial time solvability of Nash equilibrium for the 
rank-1 bilinear games. 

For all the algorithms that follow, we make the following assumptions (without loss of general- 
ity): 1) The entries of A, B, E, F, e and / are integers, since scaling them by a positive number does 
not change the set of NE. 2) The equalities Ex = e and Fy = f are all linearly independent because 
even if we discard the dependent equalities, X and Y do not change. 3) The letter C denotes the bit 
length of the input game. 

3 Rank-1 Games and Polynomial Time Algorithm 

The approach used in this section is motivated by the paper [1J. Given a rank-1 game (A, B), it is 
easy to find a G M, M and B G R N such that A + B = a ■ B T , since any two rows of A + B are multiple 
of each other. In that case B = —A + a ■ B T . Let G(a) = (A, —A + a ■ B T ) be a parametrized game 
for a fixed A G R MxN and 8 G R N . For any game G(a) the BRP of first-player is fixed to P(«) = P 
(of (O) since A is fixed. However, the BRP of second-player Q(oc) changes with the parameter. Now, 
consider the following poly tope with x, q as vector variables and A as a scalar variable: 

Q' = { (x,A,q) G E M +!+ fc 2 | Xi > o, Vz G S 1 ; x T (-Ai) + XB ] - q T F! < 0, V; G S 2 ; Ex = e) (7) 

It is easy to see that Q(ot) is the projection of {(x, A.,q) G Q' \ A = x T oc} on (x, cj)-space. In other 
words, Q(oc) is a section of Q' obtained by hyper-plane A = x T oc. Clearly, Q' covers Q(a), G R M . 
Number the equations of Q' in a similar way as the equations of Q. Let Af be the set of fully-labeled 

* Due to chance moves, the entry may correspond to multiple leaf nodes. In that case the entry stores the expected payoff. 
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pairs of P x Q', i.e., Af = {(v,w) G P x Q' \ L(v) UL(w) = {1, . . . ,M + N}}. Using the definition of 
fully-labeled pairs, it is easy to check that for a given ((y, p), (x, Kq)) G P x Q', 

({y,p),(x,X,q)) e Af O x T (Ay-E T p) =0 and (x T (-A) + A J 6 T -^ T F)y = (8) 

Lemma 2. Let (v, w) G A/", v = (y, p) and w = (x, A, (j). 

- For all a such that A = x T a, (x,y) is a NE ofG(ot). 

- For every NE (x,y) of a game G(a), there exists a (v, w) G Af, where A = x T a. 

Proof. Since (v,w) is fully-labeled it satisfies, x T (Ay — E T p) = and (x T (— A) + AjS 7 — a T F)y = 0. 
Let a be such that A = x T a then we get (x T (-A) + (x T a)^ T - <? T F)y = => (x T (-A + a/i T ) - 
a T F)y = 0. This implies that (x,y) is a NE of the game (A, — A + a/3 T ) (i.e., G(a)), since it satisfies 
the complementarity condition of (JU). 

Given a (x,y) of G(a), from (O and ((U) it is clear that Bp, q such that x T (Ay — E T p) = and 
(x T (-A + a^ 7 ) - q T F)y = 0. Let A = x T oc, then we get (x T (-A) + A/3 7 - q T F)y = 0. Therefore, 
((y,p),(i,A,<j))G^. □ 

The above lemma establishes strong relation between the set of NE of all the G(a)s and the set 
AT. Next we discuss the structure of Af, and later use it to design a polynomial time algorithm to 
find a NE of a given game G(«). 

The polytopes P and Q' are assumed to be non-degenerate^, and let k\ = k2 = k for simplicity. 
As there are k linearly independent equalities in P and Q', they are of dimension N and M + 1 
respectively. Therefore, \/(v,zv) G Af, \L(v)\ < N and |L(w)| < M + 1. Since, M + N labels are 
required for a pair (v, w) to be part oiAf,Afc 1-skeleton of P x Q' . Further, if (v, w) G Af is a vertex 
pair then |L(r?) n L(w)| = 1. Let the label in the intersection be called the duplicate label of (v,w). 
Relaxing the inequality corresponding to the duplicate label at (v, w) in P and Q 1 respectively gives 
its two adjacent edges in Af. Therefore, every vertex of Af has degree two. This implies that Af is 
a set of cycles and infinite paths (unbounded edges at both the ends). We will show that Af forms 
a single infinite path. The next lemma follows directly from the definition of P 10 and Q' 10, and 
expression ©. 

Lemma 3. For all (v,w) = ((y,p),(x,A,q)) G Px Q', we have A(/3 T y) — e T p — f T q < 0, and the equality 
holds iff{v,w) G Af. 

Lemma|3]implies that Af is captured by A(/3 T y) — e T p — f T q = over P x Q'. Using this fact and 
Lemma 13 we define the following parametrized LP. 

LP(S) - max: 5(p T y) - e T p - f T q 
s.t. P x Q'; A = S 

For an a G FX, let OPT(a) be the set of optimal solutions of LP(a) and Af(a) be the set of points 
of Af with A = a, i.e., Af = {(v, w ) G Af \ w = (x,A,q) and A = a}. 

Lemma 4. For anaeR, Af(a) ^ and OPT(a) = AT {a) 

Proof. Consider a game G(a) where oc = [a, . . . ,a]. Clearly, for any Nash equilibrium (x,y) of G(a) 
the corresponding point in Af has A = x T cc = a (Lemma0. Therefore, Af(a) ^ 0. The feasible set of 
LP(fl) is all points of P x Q' with A = a. Further, function A(/3 T y) — e T p — f T q achieves maximum 
only at points on Af (Lemma0. Therefore, OPT {a) = Af(a) follows. □ 

3 Degeneracy may be handled using standard techniques as done in (T|. 
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Lemma 5. The set Af forms an infinite path, with A being monotonic on it. 

Proof. To the contrary suppose there are cycles and multiple paths in Af. Let C be a cycle in Af. It is 
easy to see that Af(a) = intersection of Af with the hyper-plane A = a. Therefore, 3a G EL such that 
either C is contained in A = a or it cuts the cycle at exactly two points. This contradicts that Af(a) is 
a convex set in both the cases (Lemma H). 

Now let V\ and V 2 be two paths in Af. Since, Af{a) is a convex set Va G R, A is monotonic on 
both the paths. Suppose, the range of A covered by V\ and V 2 be (—00, a] and (a, inf). However, this 
contradicts the fact that V 2 is a closed set. Monotonicity of A follows from the convexity of Af(a) . □ 

3.1 Algorithm 

Let {A, B) be a given rank-1 game and A + B = 7 • B T . Let y min = min xeX EieSj 7i x i and Imax = 
max ie x Ei'eS! li x i- The 7 m;n and jmax exists since X is a bounded polytope. From Lemma|2]it is clear 
that every point in the intersection of the set TV and hyper-plane H 7 : A — £; 6 g = corresponds 
to a NE of the given game (A,B). Note that for any point in the intersection, corresponding A is 
between j m j n and jmax- Let H~ and H+ be the negative and positive half spaces of the hyper-plane 
H 7 respectively, then clearly Af(j m i n ) G H~ and Af^max) G H+. All the points in the intersection 
of Af and H 7 are between Af(j m i„) and Af(j max ). The following algorithm does binary search on A/" 
between A/"(7 m/f! ) and Af^max) to find a point in the intersection using the fact that A monotonically 
increases (similar to the algorithm in [1]). 

5 1 Initialize a 1 = j min and a 2 = j max - 

52 If the edge containing Af(ai) or Af{a2) intersects H 7 , then output the intersection and exit. 

5 3 Let a = Let u/u be the edge containing Af{a) . 

Si If u, v intersects H 7 , then output the intersection and exit. 

S5 Else if u, v G H 7 , then set a\ = a else set a 2 = a and continue from step S3. 

Correctness. Since the feasible set of LP(a) is a section of P x Q', where A = a, the OPT(a) is on 
an edge of P x Q' (assuming non-degeneracy of LP (a)). It is easy to construct this edge from the 
tight equations of P x Q' at OPT (a). Clearly, this entire edge should be part of the set Af, hence if 
this edge intersects the hyper-plane H 7 then we get a Nash equilibrium of the given game. Since, all 
the points in the intersection of Af and H 7 are between A/"(7mm) and Af^max), and A monotonically 
increases between these two (Lemma [5]), the algorithm does a simple binary search between 7„„>, 
and jnmx to find an a, such that the edge containing Af(a) = OPT(a) intersects H 7 (LemmaHJ. 

Time Complexity. Recall that C is the bit length of the input game. Since, 7 mi „ and 7 mflX are op- 
timal points of two LPs on set X = {Ex = e,x > 0}, they can be represented in poly(C, M, N) bits. 
Let Z = max{|A|, \E\, \F\, \e\, \f\, \B\}, I = M + N + h + k 2 + 1, and A = l\Z l . 

Theorem 1. The above algorithm finds a NE of game (A, B) in time poly(C, M, N). 

Proof. One round of steps S3 to S5 can be done in polynomial time since computation of Af(a) re- 
quires solving LP(a) (Lemma HI, and computation of u, v n H 7 requires checking the feasibility of 
a polytope. Now, to show polynomial time complexity, we need to bound the number of rounds of 
steps S3 to S5. 

Note that the denominator of any co-ordinate of a vertex of P x Q' is at most A, and if A is not 
constant on an edge of Af, then the difference in its value between the two end points of the edge is 
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at least ■^ z . Therefore, if «2 — "1 < ^ the algorithm terminates. After k rounds a^ — a\ = 7wa * t ' m '" . In 
round k if a% — a\ = 7 """ 2t 7 ""'" > then A: < log(7 mflX — 7 m/n ) + 2 log A. Therefore, the algorithm is 
guaranteed to terminate after log(7 mflX — j m i„) + 2 log A + 1 = poly(C, M, N) many rounds. □ 

4 FPTAS for Rank-fc Games 

In this section, we discuss fully polynomial time approximation schemes for fixed rank games (i.e., 
rank(A + B) is constant). The approximation notion in bilinear games can be defined in a similar way 
to that of bimatrix games given by Kannan et al. [17|. Let x max = max re x %i, Vmax = rnax ye y Y,j Vj 
and D = \A + B\. Clearly the total payoff derived from a strategy profile (x, y) G X x Y is at most 
XmaxDymax. Using this we define an e-approximate NE for a bilinear game (A, B) as follows. 

Definition 2. For a strategy profile (x, y) G X x Y, let u = max r ' G x x' T Ay and v = maxy G y x T By'. Then 
(x,y) is an e-approximate NE of the game (A, B) ifu + v — x T (A + B)y < e(x max Dy max ). 

For a bimatrix game x max Dy max = D, since x and y are probability distributions, which is com- 
patible with the definition of |17). Next we define a stronger notion of e-approximate NE called 
relative e-approximate NE, where the error is relative to the maximum achievable payoff from the 
given strategy. 

Definition 3. For a strategy profile (x,y) G X x Y,let u = max^ e x x lT Ay and v = maxygy x T By'. Then 
(x,y) is a relative e-approximate NE of the game (A, B) ifu + v — x T (A + B)y < e(u + v), i.e., the total 
error is relatively small. 

Since the value of u + v is at most x max Dy max , if (x,y) is relative e-approximate NE, then it is 
also e-approximate NE. For all the examples mentioned in Section IZTl a (relative) approximate NE 
of the bilinear game formulation can be straightforwardly turned into a (relative) approximate NE 
of the corresponding finite game under standard definitions. Without loss of generality we assume 
that A, B, E, F, e and / are integer matrices, since scaling them by a positive value does not change 
the set of (relative) e-approximate NE. Next we discuss two FPTAS to solve QP of (O, one for each 
definition of approximation. The approaches used in these algorithms are generalization of the ones 
inHZl. 

4.1 FPTAS for Approximate NE 

We show that the result by Vavasis p8| can be applied to get an e-approximate Nash equilibrium 
(Definition |2]). The following proposition states the result by Vavasis. 

Proposition 4. Let min{^x T Qx + q T x : Ax < b} be a quadratic optimization problem with compact 
polytope {x G R" : Ax < b}, and let the rank ofQ be a fixed constant. Ifx* and x # denote points minimizing 
and maximizing the objective function f(x) = ^x T Qx + q T x in the feasible region, respectively, then one can 
find in time poly{C, ^) a point x^ satisfying 

f(x«)-f(x*)<e(f(x # )-f(x*)). 

Now consider the following QP formulation of (0, which captures all the NE of (A, B) at its optimal. 

min: e T p + f T q — x T (A + B)y 
s.t. Ay -E T p < 0; Fy = f; y > 
x T B - q T F < 0; Ex = e; x>0 
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Theorem 2. Let (A, B) be a rank-k game, then for every e > 0, an e-approximate Nash equilibrium can be 
computed in time poly(C, ~), where C is the bit length of the game and k is a constant. 

Proof. The objective function of the above QP can be easily transformed to the standard QP form 
jX T Qx + q T x, where rank(Q) = 2k. To apply Proposition|4]on this QP, we need to bound its feasible 
set. Since, {x : Ex = e, x > 0} and {y : Fy = f, y > 0} are compact, the only variables to bound are 
ps and qs. Since, the maximum possible value of x T (A + B)y for any (x,y) G X x Y is x max Dy max , 
the value of e T p + f T q is at most x max Dy max at any point of the polytope corresponding to NE (by 
(2}). Therefore, we impose e T p + f T q < x max Dy max . However, this may not bound the ps and ^s. 

LetZ = max{|A|, \B\, \E\, \F\, \e\, |/|} and / = M + N + k x +k 2 . Recall that NE of anon-degenerate 
game correspond to vertices of the polytope. It is easy to see that maximum absolute value of a co- 
ordinate of any vertex in the polytope is at most HZ 1 . Further, the quantity l\Z l can be represented in 
poly(C) bits. Therefore, imposing —HZ 1 < p < HZ 1 and —HZ 1 < q < HZ 1 in the above QP incur only 
a polynomial increase in its representation and does not change its optimal set. The minimum and 
the maximum objective values of this QP are zero (Lemma [3]) and at most 2x max Dy max respectively. 
Let ((y°, p°), (x°, q )) be the solution given by Vavasis algorithm for |, then from Proposition H] we 
get, 

eV + /V - x« T {A + B)y <> < e(x max Dy max ) 

From the primal-dual formulation of (D it is clear that max x / GX x' T Ay° < e T p <> and maxygy x^By' < 
f T q°. Therefore, we get max^ e x x' T Ay* + max y r G Y x<>T By' — x <>T (A + B)y <> < e(x max Dy max ). □ 

4.2 FPTAS for Relative Approximate NE 

Let the rank of a game (A, B) be k, then A + B = Ya =1 cc(i)B(i) 1 , where V/, a(i) G R M and B(i) G E N . 
We assume that the game is such that a(i)s and B(i)s are positive vectors. For all i < k, let if, = 
mm xex* T #(0 and w'j = max xe x^ r *(0' similarly let z,- = min 1/e y B(i) T y and z- = maxy 6 y B(i) T y. 
Note that w\, w\, z, and z • can be represented by poly(C, M, N) bits, since X and Y are compact. Given 
an e > 0, consider the sub-intervals [w ir (1 + e)Wj\, [(1 + e)wi, (1 + e) 2 iVj] of \W{, w'^\ and similarly of 
[z,, z-] . All combinations of these intervals form a grid in 2/c-dimensional box B = x [iVj, wj] x , [z„ zj] . 
Let (x,y) G X x Y be such that V/, x T a(z') G [uj, (1 + e)u,-] and B(i) T y G [y,-, (1 + e)Vj\, then clearly, 

£ M,^ < x T (A + B)y < (1 + e) 2 u^j (9) 

!=1 ;=1 

For a fixed hyper-cube of the grid, consider the following LP based on the QP of (0 

min: e T p + f T q 
s.t. Ay < E T p, Fy = f, y > 
x T B < q T F; Ex = e; x > 

u t < x T oc(i) < (1 + e)ui) Vj < B(i) T y < (l + e)v ir V/ 

Algorithm. Run the above LP for each hyper-cube of the grid, and output an optimal point of the one 
giving the best approximation. As the number of hyper-cubes in the grid is poly(C,l/ log(l + e)), 
the running time of the algorithm is poly(M, N, C, 1/ log(l + e)). 

Correctness. Next we show that the above algorithm gives ^1 — ^j^i j -approximate NE of the 
game (A, £>). Let (x',y') be a NE of the given game, and {p',q r ) be such that x' T Ay 1 = e T p' and 
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x' T By' = f T q'. Consider the hyper-cube containing (x' T a(l),. . . ,x' T a(k),B(l) T y', . . .,B(k) T y') of 
the grid and corresponding LP. Clearly, (x',y' r p',q') is a feasible point of this LP and Yh=i u,-fj < 
e T p' + f T q' < (1 + e) 1 E{=i u i v i> since e T p' + = x' T (A + B)y'. Therefore, at the optimal point 
(x, y,f,q) of the LP we get e T p + f T q < (1 + e) 1 u i v i, an d this gives, 

x T (A + B)y > Eii > (using (0)) 

e T p + fq - x T ( A + B)y < (l - (e T p + fq) 

Let ^ = ^1 — j^^j, u = max-ygxT 1 Ay, and 5 = max 7G Y ^ T f>7- Clearly, e T p > u and fq > v 
(using <Q}). Let D = e T p + -u-v, then + - f T (A + B)y = e T p + f T q — D — x T (A + B)y < 
f/(e T p + / T ^) — D < p(e T p + f T q — D) = p(u + u), since p G (0, 1). Therefore, (x,y) is a relative 
^-approximate NE of the given game (A, B) (Definition [3). 

Theorem 3. Lef (A, B) &e a rank-k game, and A + B = a(i)f}(i) T , such that ec(i)s and B(j)s are 
positive vectors. Then given an e > 0, a relative ^1 — ^j^p ) -approximate NE can be computed in time 
poly(C, 1/ log(l + e)), w/iere C is the input bit length. □ 

For a symmetric game (B = A T , E = F,e = f), an (relative) e-approximate symmetric NE can 
be defined as an (relative) e-approximate NE with the same strategies, i.e., x = y. It is easy to check 
that, if we use the QP formulation of (© instead of <(5]) in any of these algorithms, then the output 
strategy is an (relative) e-approximate symmetric NE strategy. 



5 Games with a Low Rank Matrix 

In this section we show that if rank of even one payoff matrix (A or B) is constant, then Nash equi- 
librium computation can be done in polynomial time. Recall the best response polytopes P and Q 
(0 for the bilinear game (A, B). 

Lemma 6. Given a game (A, B), there exists a vertex pair ((y, p), (x,q)) G P x Q such that (x,y) is a NE 
of(A,B). 

Proof. All the solutions of (SJl over P x Q are the NE of the game, and existence of a solution is 
guaranteed (Proposition [l). Suppose v G P x Q is a NE of (A, B). It is easy to check that the entire 
face, formed by the set of tight equations at v, is solution, and it contains at least one vertex, as P x Q 
is bounded from one side. □ 

Lemma 7. Eet k\=ki = k and rank(A) = I. The polytope P has at most 0(N ,+k ) vertices. 

Proof. From 10, it is clear that P is in (N + k) -dimensional Euclidean space, however Fy = / gives 
k linearly independent equalities. Therefore, P is of dimension N, and at a vertex of P, N linearly 
independent inequalities must be tight. Since A is of rank I, rank([A -E]) < I + k. Therefore, 3!S C 
Si, |S| = l + k such that Vz G Si \ S, A{y — p T E l < are not needed in defining the polytope P. At 
a vertex, if d inequalities are tight from S then rest N — d must be of type y; = 0, hence for a fixed 
D C S, \D\ = d, there are at most ( N N rf ) = (^) choices to form a vertex. Therefore, the total number 
of vertices are at most ji±* ('+*) ( N { ) < 2 l+k N l+k . D 

Note that if we remove the assumption k\ = ki = k then the exponent of N turns out to be a 
linear function of I, k\ and /C2. A similar proof can be worked out for Q. 
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Theorem 4. If rank of either A or B is constant then a Nash equilibrium of a bilinear game (A, B) can be 
computed in polynomial time, assuming k to be a constant. 

Proof. Suppose rank(A) = I (a constant) and v = (y, p) G P be a vertex. We can check in polynomial 
time whether v corresponds to a NE or not as follows. Let S x = {i G Si | A\y — p T E' = 0} and 
S y = {; G S 2 | yj > 0}. Consider all (x, q) G R M+k such that 

_ V; G S y , x T B' - q T P = 0; Vz G S. T/ x, > 
~ g; V; & S y , x T Bi - q T P < 0; Vz & S x , x, = 0. 

Every such (x, q) lies in Q and makes a fully-labeled pair with v, and hence forms a NE (Lemma|l). 
Note that such an (x, q) can be obtained in polynomial time by solving an LP. Now the proof follows 
from Lemmas |6] and A similar argument proves the other case when rank(B) is constant. □ 

As the set of bimatrix games is a subclass of the bilinear games (Example H), where k\ = k% = 1, 
Theorem H] strengthens the results by Lipton, Markakis and Mehta )l22| (Corollary 4), and Kannan 
and Theobald [17] (Theorem 3.2), where they require that the rank of both A and B to be constants. 
Note that k in Bayesian games depends on the number of types of players and in the sequence form, 
it depends on the number of information sets of players. Therefore, this result can be applied to 
these games if in their bilinear representation, a payoff matrix has low rank and k is constant. 

In fact Theorem H] gives a polynomial time algorithm to enumerate all the extreme equilibria of a 
bilinear game with a constant rank matrix, and an exponential time enumeration algorithm for any 
bilinear game. A similar (exponential time) algorithm was given by Avis et al. (21 to enumerate all 
Nash equilibria of a bimatrix game. 

6 Conclusion 

We have defined two-player bilinear games, where payoffs are represented by two matrices (A,B) 
and strategy sets are compact polytopes. In both bilinear and bimatrix games, the utilities are bi- 
linear functions of strategy vectors. The scope of these games is large enough to capture many in- 
teresting classes of games like bimatrix games, two-player Bayesian games, polymatrix games, and 
two-player extensive-form games with perfect recall. Considering the rank-based hierarchy puts 
a structure on bilinear games, and by exploiting this structure and the similarity between bilin- 
ear and bimatrix games, we extended various combinatorial and algorithmic results, pertaining to 
the efficient computation of Nash equilibria, from bimatrix games to bilinear games. It will be in- 
teresting to know what other results of bimatrix games extend to bilinear games like 1) designing 
Lemke-Howson type algorithm for NE computation, 2) extending other algorithms for computation 
of approximate equilibria, etc. 
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